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(57) Abstract 

An isolated gene cluster of Micromonospora echinospora which codes for calicheamicin biosynthesis. The biosynthetic gene cluster 
contains genes encoding for proteins and enzymes used in the biosynthetic production of calicheamicin, including the aryltetrasaccharide 
and aglycone. The gene cluster also includes the gene conferring calicheamicin resistance. The invention also provides isolated genes of 
the biosynthetic cluster and their corresponding proteins. In addition, the invention relates to DNA hybridizing with the calicheamicin gene 
cluster and the isolated genes of that cluster. Expression vectors containing genes of the biosynthetic gene and their functional variants are 
also provided. The invention also relates to host cells conjugated with DNA isolated from the Micromonospora echinospora spp. calichensis 
genome. 
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Micromonospora echinospora genes encoding for biosynthesis of 
calicheamicin and self-resistance thereto 



5 This application claims benefit from provisional application 60/1 1 K325 filed on 

December 7, 1998, which application is incorporated herein by reference in its entirety. 

Field of the Invention 

The present invention relates to a biosynthetic gene cluster of Micromonospora 
10 echinospora spp. calichensis. In particular, the calicheamicin biosynthetic gene cluster 

contains genes encoding for proteins and enzymes used in the biosynthetic pathway and 
construction of calicheamicin" s aryltetrasaccharide and aglycone, and the gene conferring 
calicheamicin resistance. The present invention also relates to isolated genes of the 
biosynthetic cluster and their corresponding proteins. In addition, the invention relates to 
15 DNA hybridizing with the calicheamicin gene cluster and the isolated genes of that 

cluster. The invention also relates to expression vectors containing the biosynthetic gene 
cluster, the individual genes, or functional variants thereof 

Background of the Invention 

20 The enediyne antibiotics, which were discovered in the 1980's, have long been 

appreciated for their novel molecular architecture, their remarkable biological activity, 
and their fascinating mode of action. Enediyne antibiotics were originally derived by 
fermentation of microorganisms, including Micromonospora, Actinomedura, and 
Streptomyces. Rothstein, D. M, Enediyne Antibiotics as Antitumor Agents, p. 2 (1995). 
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As a class, the enediyne antibiotics have been referred to as the most potent and highly 
active antitumor reagents yet discovered. Rothstein, D. M. Enediyne Antibiotics as 
Antitumor Agents, preface (1995). 

To date, at least twelve members of this family of antibiotics have been 
5 discovered, all of which fall rpughly into two categories. The first category of enediynes 

is classified as chromoprotein enediynes because they possess a novel 9-membered ring 
chromophore core structure, which also requires a specific associated protein for 
chrompphore stabilization, The second category of enediyne is classified as non- 
chromoprptein enediynes. These enediynes contain a 10-membered ring, which requires 

10 no additional stabilization factors. This enediyne ring structure is often referred to as the 

"warhead." The warhead induces DNA damage, which is frequently a double-stranded 
cleavage and appears to. be irreparable. This type of DNA damage is usually 
nonrepayable for the cell and, is most often lethal. Because of these remarkable chemical 
and biological properties, there has been an intense effort by both the pharmaceutical 

15 industry and academia to study, these substances with the goal of developing new and 

clinically useful therapeutic anti-turnor agents. 

The 9-membered ring chromoprotein enediyne subfamily is comprised of: 
neocarzinostatin from Streptomyces carzinostaticus,(Mytrs, A.G., et al., J. Am. Chem. 
Soc, 110, 7212-7214 (1988)); kedarcidin from Actinomycete L585-6, (Leet, J.E., et al., J. 

20 Am. Chem. Soc. ,114, 7946-7948 (1 992)), N 1 999 A2 from Streptomyces globisporus, 

(Yoshida, K., et al. Tetrahedron Lett, 34, 2637-2640 (1 993)), maduropeptin from 
Actinomadura madurea^ (Schroeden D.R., et al., J. Am. Chem. Soc, 116, 9351-9352 
(1994)); N1999A2 from Streptomyces sp. AJ9493, (Schroeder, D.R., et al., J. Am. Chem, 
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Soc\, 116, 9351-9352 (1994)): actinoxanthin from Actinomyces globisporus, (Khokhlov, 
A.S., et al., ,/ Antibiot., XXII 541-544 (1969)); iargomycin from Streptomyces 
pluricolorescens, (Yamaguchu T., et al., J. AniibioL:XXUl 369-372 (1970)); 
auromomyciri from Streptomyces macromoffty^ T. ? et ah, J. Antibiot., 

XXXll, 330-339 (1979)'), and sporamycin fromSthptosporangium pseudovutgare, 
(Komiyama, K^ et al., 1 Antibiot, XXX; 202-20'S (1977)) all -off Which are believed to 
possess a novel bicylo[7.3.0.]dodecadiynene chrombpHore core structure 1 ess for 
biological activity. In addition, with the exception of N1999A2, a required apoprotein 
acts as a stabilizer &nd specific carrier for thfc unstable chrdrriophore, and for its transport 
and interaction with target DNA. 

The non-chromophore enediyne subfamily is comprised of calicheamicin from 
Micromonospora echinospora spp. calichensis\ nafrienamicin from Polysyncraton 
lithostrotum; esperamicin from Actinoinddutaverrucdsospdra; and dynemicin from 
Micromonospora chersina. 1 -* - 

Enediyne antibiotics have potentiaFas Wntibancer agents because of their ability to 
cleave DNA, however, many of these compounds' are "too toxic to be used currently in 
clinical studies. Today, only calicheamicin is known to' be currently used in clinical trials 
and it has provided promising results as an anticancer agent. The enediynes potentially 
have utility as anti-infective agents, provided* that toxicity can be managed. 

The toxicity of the enediyne compounds; inciuding calicheamicin, centers on the 
problem of directing the compound to the cleave only the DNA of interest, such as tumor 
cell DNA, and not the DNA of the host. Due to calicheamicin's powerful ability to cleave 
DNA, scientists have investigated the mechanism by which calicheamicin-producing 
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organism protects itself against the DNA-cleaving activity of the molecule. Rothstein. D. 
M, Enediyne Antibiotics as Antitumor Agents, p. 77 (1995). Prior to this invention, 
knowledge of genes encoding for nan-chromoprotein enediyne self resistance was 
completely lacking. Insight into howMicrornonospora self resistance gene and gene 
products act to controlthe toxic effects of caHcheamicin offers new avenues of clinical 
research. For example, knowledge of the mechanisms underlying caHcheamicin 
resistance could provide the: means necessary to use higher doses of calicheamicin while 
simultaneously inhibiting. the tdxtc^ffects of the drug on. non-cancer cells. Additionally, 
understanding the mechanism behind calicheamicnrs self-resistance may aid in the 
understanding of self-resistance in. other enediyne antibiotics, thereby potentially making 
useful those. enediynes once thought to be too toxic to be viably used as therapeutic 
agents. The calicheamicin selftresistance mechanisms elucidated utilizing the present 
invention provide gene therapy approaches, for example, via introduction of enediynes 
resistance genes into bone marrow cells, thereby increasing resistance and allowing 
tolerance to chemotherajieutic'doses of calicheamicin.. Banerjee, D.» et al.. Stem Cells, 12, 
378-385 (1994). Thiis,; understanding calicheamicin self-resistance will significantly aid 
continuing clinical studies involving calicheamicin and the enediynes. The present 
invention addresses this need as it provides for the isolation and characterization of a 
resistance gene and its associated protein for any nonchromoprotein enediynes. 

Calicheamicin has two distinct structural regions: the aryltetrasaccharide and the 
aglycone (also known as the warhead). The aryltetrasaccharide displays a highly unusual 
series of glycosidic, thioester, and hydroxylamine linkages and serves to deliver the drug 
to specific tracts (S'-TCCT-S' and 5'-TTTT-3') within the minor groove of DNA. The 
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aglycone of calicheamicin consists of a highly functionalized 
bicyclo[7.3.1]tridecadiynene core structure with an allylic trisulfide serving as the 
triggering mechanism. McGahren. W.J.,et aL, Enediyne Antibiotics as Antitumor Agents, 
pp. 75-86 (J995); Once the aryhetrasaccharide is firmly docked, aromatization of the 
bicyclo[7.3/l]tridecadiynene core structure^ via f a l,4 L dehydrobenzene-*diradical, results in 
the site specific oxidative double' strand ; scission; of the targeted DNA, Zein, N.,et aL, 
Science, 240, 1 1 98-1 201(1988). The agly come undergoes a reaction that yields carbon- 
centered diradicials, which are responsible for DNA cleavage. This activity has sparked 
considerable interest in the pharmaceutical industry culminating in the recent success of 
calicheamicin-antibody conjugates (CMA-676) to treat acute myelogenous leukemia 
(AML) in phase III trials. Additionally, similar strategies have been used in phase I trials 
to treat breast cancer. A massive program to examine calicheamicin conjugated to 
alternative delivery systems has also recently been undertaken. Hamann, P.R., et al., 87th 
Annual Meeting of the American Association of.Cancer Research, Washington, D.C., pp. 
471 (1996); Hinman, L.M., et aL Cancer Res:, . 53\ 333.6 (1993); Hinman, L. M, et al., 
Enediyne Antibiotics as Antitumor Agents, 105 (1995); Sievers, E.L., et aL, Blood, 

93, 3678-3684 (1999); SiegeL M.M v et aL ? Anal. Chem., 69, 2716-2726 (1997); Ellestad, 
G. personal communication. . ..... 

The biological activity and molecular architecture of calicheamicin has also 
prompted a search for the potentially useful analogs. Gf the numerous laboratories 
producing synthetic analogs, one group has produced a novel calicheamicin d\ shown to 
effectively suppress growth and dissemination <of liver metastases in a syngeneic model of 
murine neuroblastoma. Lode, H. N., etaL> Cancer Res., 58, 2925-2928 (1998); Wrasidlo, 
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W., et dX^ Acta Oncologica, 34. 157-164 (1995). In addition to synthesizing 
calicheamicin analogs, random mutagenesis of M. echinospora and screening for mutant 
strains with improvedbjosynthetic potential has also been pursued. Rothstein, D. M., 
Enediyne Antibiotics, as Antitumor Agents, pp. 1 07-1 26 ( 1 995). 
5 The first total; synthesis,of calicheamicin was reported by Nicolaou and coworkers 

in 1,992. Synthesizing this complex antibiotic, though, presents ipany disadvantages. For 
example, Nacelle's procedure o,r\ly provides approximately a 0.007% yield and requires 
47 steps. Halcomb,. RX., Enediyne Antibiotics as Antitumor Agents, pp. 383-439 ( 1 995). 
Thus, the total synthesis of calicheamicin remains secondary to the isolation of 

10 calicheamicin from large fermentations of M. echinospora. Therefore, methods to 

produce mass amounts of calicheamicin and potentially useful variants are still needed, 
Fantini, A,, et aL. Enediyne Antibiotics as Antitumor Agents, pp. 29-48 ( 1 995). 
Transforming calicheamicin J>N A intp producing strains of bacteria, £. coli for example, 
would address this need. Currently there are no cloned M. echinospora genes and only a 

15 set of limited studies upon putative M. echinospora promoters are available. Lin, L.S., et 

aL, J. Gen. Microbiol, 138J8UAU5 (1992); Lin, L.S., et aL 1 BacterioL, 174,3111- 
3117 (1992): Baum, E.Z., et aL, X Bacteriol, 17L 6503-6510 (1989); Baum, E.Z., et al., 
J. BacterioL, 1 70, 7 1 -77 ( 1 9S8) ; . . 

Having calicheamicin DN A opens the door for genetic analysis of calicheamicin 

20 biosynthesis, as such analysis requires the ability to obtain large qualities of calicheamicin 

DNA. For example, one can study calicheamicin biosynthesis by mutagenesis of M 
echinospora, including the isolation and characterization of mutants blocked in 
calicheamicin biosynthesis and the subsequent analysis of their defective or partial 
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calicheamicin products. Additionally, particular enzyme could be overexpressed or 
underexpressed after subcloning its gene into a host such' as" E. coli: and the results of 
such overexpression studied to reveal the enzyme's function. Furthermore, the cloning of 
biosynthetic genes can ultimately result in increased yields of the gene product by cloning 
5 and expressing the biosynthetic gene encodihg^he rate-limiting enzyme back into the 

producing organism. If may also be possible to generate novel products by cloning 
biosynthetic genes into strains that make related cbmpounds/ Such genes could endow 
the host organism with the ability to carry out nfeiv reactions on the enediyne nucleus, and 
thus produce novel dfiigs. "* ' 

10 Calicheamicin ? s molecular architecture in conjunction with its useful biological 

activity and potential therapeutic value brand calicheamicin an target for the study of 
natural product biosynthesis. While the radical-based mechanism of oxidative DNA 
cleavage by calicheamicin (i.e. aromatizatibn of the bicyclo[7.3. 1 ]tridecadiynene core 
structure, via a 1 ,4-dehydrobenzene-difadical; resulting in the site specific oxidative 
15 double strand DNA cleavage) is well understood; it was Unknown, prior to this invention, 

how Micromonospora constructs calicheariiiciri. As a result, there is a need to discover 
and understand calicheamicin biosynthesis. Prior to this discovery, knowledge of genes 
encoding for nonchromoprotein enediyne biosyiithesis was completely lacking. Thus, 
this invention relates to the first identification, isolation, and cloning of a 
20 nonchromoprotein enediyne biosynthetic gene Cluster and mapping and nucleotide 

sequence analysis of the g£nes within the cluster . The invention provides the entire 
calicheamicin-biosynthetic cluster and biochemical studies of aryltetrasaccharide 
biosynthesis. Furthermore, the calicheamicin self-resistance gene and protein have been 
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isolated as have the genes and resulting enzymes for steps within the calicheamicin 
cascade. The invention also provides for construction of enediyne overproducing strains, 
for rational biosynthetic modification. of bioactive secondary metabolites, for new drug 
leads* and for an enediyne combinatorial biosynthesis program. 

The present invention, thus, also relates to a biosynthetic modification of bioactive 
secondary metabolites through enediyne combinatorial biosynthesis. As most 
pharmaceutical drug leads are inspired by naturally occurring compounds, and given the 
challenge posed in synthesizing these metabolites, genetic manipulation of the sugar 
appendage on the metabolites offers avenues for creating potential new drugs. Thus the 
emerging field of combinatorial- biosynthesis has become a rich new source for modified 
non-natural sugar scaffolds. Marsden, A., et aL Science 1998, 279, 199-201. Problems 
inherent with the genetic manipulation of the sugar appendage relate to the fact that 
naturally occurring bioactive secondary metabolites possess unusual carbohydrate 
ligands, which serve as molecular recognition elements critical for biological activity. 
Macrolide Antibiotics \\Ghemistryr Biology and Practice, 1984. Without these essential 
sugar attachments, the biological activities of most clinically important secondary 
metabolites are either completely abolished or dramatically decreased. Currently, 
techniques for the genetic manipulation of the sugar appendage for a given metabolite 
rely mainly on the, alteration and/or deletion of a small subset of genes required to 
construct and attach each desired sugar moiety. Thus there is a need to develop alternate 
strategies to construct and attach non-naturally occurring sugars. The present invention 
addresses this need. The present invention utilizes the fact that glycosyltransferases, 
which are responsible for the final glycosylation of certain secondary metabolites, show a 

8 
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high degree of promiscuity toward the nucleotide sugar donor. Zhao, L., et ah, J. Am. 
Chem, Soc. 1988, 120, 12159-12160. This liriselectivity of the glycosyltransferases has 
the potential for allowing modification of the crucial glycosylation pattern of natural, or 
non-naturaL secondary metabolite scaffolds in a combinatorial fashion. The present 
invention discloses a method using the recmitmerit and collaborative action of sugar 
genes from a variety of biosynthetic pathways to construct composite gene clusters, which 
make and attach non-natural sugars. 

Summary of the Invention 

The present invention provides an isolated nucleic acid molecule from 
Micromonospora echinospora encoding for a gene from a nonchromoprotein enediyne 
biosynthetic gene cluster, the protein coding region of the gene or a biologically active 
fragment of the gene. In particular, the present-in vention provides an isolated nucleic acid 
molecule, gene, or gene cluster from MicromonosporU echinospora spp/ calichensis that 
is involved in the biosynthesis of calicheamicin. In another embodiment, the present 
invention also relates to nucleic acids capable of hybridizing with a nucleic acid molecule 
from Micromonospora echinospora spp. calichensis coding for one or more genes from a 
nonchromoprotein enediyne biosynthetic getieciuster In a further embodiment the 
invention provides an expression vectdr cdm'pri£irig an isolated nucleic acid molecule 
from a nonchromoprotein enediyne biosynthetic gene cluster from Micromonospora 
echinospora. In yet a further embodiment the invention-provides a cdsmid comprising an 
isolated nucleic acid molecule from Micromonospora echinospora comprising a nucleic 
acid sequence encoding for a nonchromoprotein enediyne biosynthetic gene cluster. 
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In preferred embodiments, the invention provides the isolated nucleic acid 
molecules of SEQ ID Nos. 1 , 3. and 5. 

In an additional embodiment, the present invention provides a host cell 
transformed with an isolated nucleic acid molecule from a nonchromoprotein enediyne 
biosynthetic gene cluster from Micromonospora echinospora. Host cells can optionally 
be of bacterial, yeast, fungal, insect, plant or mammalian origin and can be transformed 
according to standard methods. In a preferred embodiment, the host cell is the bacterium 
£. coli or Streptomyces, In a further embodiment, the invention is directed to a 
transformed host cell with an expression vector encoding gene calC. or a functional 
derivative thereof operably linked to regulatory sequences that enable expression of ca/C. 
In a yet further embodiment, the invention provides a transformed host cell with an 
expression vector encoding the gene calH, or a functional derivative thereof operably 
linked to regulatory sequences that enable expression of calH. Likewise, the invention 
provides a transformed host cell with an expression vector encoding the gene calG, or a 
functional derivative thereof, operably linked to regulatory sequences that enable 
expression of calG. 

The invention further provides a method of expressing a protein by culturing a 
host cell transformed with an expression vector comprising an isolated nucleic acid 
molecule from Micromonospora echinospora encoding for a gene from a 
nonchromoprotein enediyne biosynthetic gene cluster, and incubating the host cell for a 
time and under conditions allowing for protein expression. In another embodiment the 
invention provides a method of purifying calicheamicin using affinity chromatography. A 
sample containing calicheamicin is contacted with an affinity matrix having the protein 

10 
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CalC bound thereto, for a time and under conditions allowing calicheamicin to bind to the 
matrix, eluting calicheamicin from the matrix, and recovering calicheamicin. 

In a further embodiment the present invention provides polypeptides having the 
amino acid sequences of SEQ ID Nos. 2, 4, and 6. 

In yet a further the invention provides the production of the following two new 
macrolides: 
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The invention further provides a method of conferring calicheamicin resistance to 
a subject comprising obtaining cells from the subject, transforming the cells with the 
calicheamicin self-resistance gene, and returning the cells to the subject. Alternatively, 
the calicheamicin self-resistance gene can be targeted and delivered to the desired host 
cells through known gene therapy delivery systems. 



20 



Brief Description of the Figures 

Figure 1 depicts the summary of the cosmid clones isolated from M echinospora 
genomic library. This figure illustrates the results of the screening of the genomic library 
for clones carrying the calicheamicin bio^ynthetic cluster. 

Figure 2 shows a restriction map of a portion of cosmid clones 4b, 13a, and 56 and 
the corresponding location of cal genes from M. echinospora. 
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Figure 3 is a table of the open reading frames ("orfs") in the calicheamicin 
biosynthetic cluster. This table lists the polypeptides that the genes encode for as well as 
their proposed or actual determined function in the biosynthetic pathway. 

Figure 4 is a graph of the UV-visible absorption spectra of purified mbp-CalC. 
The purified mpb-CalC was analyzed j n the following solution: 52 pM mpb-CalC; 1 0 
mM Tris-HCk pH 7.5). The inset shows the results of low temperature (4.3 K) the X- 
band EPR analysis of CalC. 250 pM mpb-CalC containing 0.5 mol Fe per mol CalC was 
analyzed in 1 0 mM Tris-HCL pH 7.5, The spectrometer settings were as follows: field 
set = 2050 G; scan range = 4,000G; time constant = 82 s; modulation amplitude =16 G; 
microwave power - 31 pW; frequency = 9.71 Ghz; gain = 1000; determined spin 
quantitation = 90 ± 10 pM Fe. 

Figure 4(b) provides the results of the mbp-CalC in vitro assay. 

Figure 5 depicts the postulated routes for the biosynthesis of required nucleotide 
sugars. The enzymes are depicted as follows: E deox = deoxygenase; E am = 
aminotransferase; E ep = epimerase; E mef = methyltransferase; E od = 4,6-dehydratase; E ox = 
oxidase; E p = nucleotidyltransferase; E r e d = reductase; E sh = sulfhydrytransferase. 

Figure 6 illustrates a schematic representation of the in vivo production of 
pikromycin/methymycin-calicheamicin hybrid metabolites. 

Figure 7 depicts the Streptomyces Venezuela methymycin/pikromycin gene cluster. 
Eight open reading frames (des\-desy\\\) in this cluster have been assigned as genes 
involved in desoamine biosynthesis. This figure also depicts the hybrid pathway toward 
new methymycin/pikromycin derivatives (1 1 and 12) produced after heterologous 
expression of the calH gene of calicheamicin in a S. Venezuela mutant. 
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Figure 8 illustrates calicheamicin's (6) four unique sugars which are crucial to 
tight DNA binding. Sugar (9) is derived from 4 : amino-4,6-dideoxyglucose (8) and is part 
of the restricted N-0 connection between sugars A and B. Compound 8 is derived from 
the corresponding 4-ketosugar (7) via a transamination reaction. The gene calH encodes 
5 the desired C-4 aminotransferase allowing conversion of compound (7) to compound (8). 

Detailed Description ot the Invention 

The preterit 'invention is directed to' the isolation and characterization of the 
calicheamicin btosynthetic cluster. This cluster encodes the genes that encode the 

1 0 proteins and enzymes that are involved in the deoxysugar synthesis (the 

aryltetrasaccharide), polyketide biosynthesis (the aglycdne) of calicheamicin synthesis, 
and calicheamicin resistance. Twenty-one structural genes have been identified that 
encode for the aryltetrasaccharide sugar ligatids (^20 kb); approximately eight modules 
(-40 kb) are required for the 15-cairboh aglycone^ Four proteinslnvolved in transport and 

15 uptake, one protein conferring resistance; and b c he regulatory protein have been identified. 

The calicheamicin biosynthetic gene cluster comprises the following genes: cal A, 
calB, ca/C, calD, calE, calF, calG. calti, dMl catt, calK, cafL, caM, ca/N, calO, cal?, 
calQ, caIR, calS, calf, orfi, or/2, drfh\ 6t$, difi, oiffi, or/7, and an IS-element gene. 
The above listed genes encode for the following polypeptides: CalA (328 amino acids), 

2 0 CalB (56 1 amino acids), CalC (181 miiid acids); CalD (263 amino acids), CalE (420 

amino acids), CalF (245 amino acids), CaiG (990 amino acids), CalH (338 amino acids), 
Call (568 amino acids), CalJ (332 amino acMs), CalK (440 amino acids), Cal L (562 
amino acids), Cal M (416 amino acids), CalN (39^ amiha acids), CalO (331 amino acids), 
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Cal P (approximately 179 amino acids). CaiQ (453 amino acids).. CaIR (265 amino acids). 
CalS (1113 amino acids), Call (.280 amino acids). Orf 1(322 amino acids), OrO (654 
amino acids). Ocf3 (209 amino acids), Orf4 (52 1 amino acids), Orf5 (1 75 amino acids). 
Orf6 (139 ajnino acids), Orf7 (187 amino acids), and IS-eiement (402 amino acids). 

In elucidating the caHcheamicin biosynthetic gene cluster, the. inventors began 
with a genomic library containing the genome of Micromonospora echinospora spp. 
calichensis. The cosmic library.was generated by isolating chromosomal DNA of 
Micromonospora echinospora zpp, /:aiichen$i&\ fragmenting that chromosomal DNA. 
inserting the DNA into a cosraid vector and generating acosmid library according to 
methods well known in the art. This procedure can be performed in any species of 
Micromonospora. 

Based upon prior enediyne metabolic labeling studies it was postulated that the 
calicheamicin aglycone would be polyketide derived. Polyketide metabolites encompass 
a vast variety of structural diversities yet share a common mechanism of biosynthesis. 
Hutchinson, CR.,.et a!.-, Ckem..fiev.,..97 t - 2525-2535 (1997); Strohl, W.R., et al, 
Biotechnology* of Antibiotics pp. 577-657; Fujii, L, et Chem. Rev.. 97, 2511-2523 
(1997); Hopwood, D.A., et al., Chem Rev. f 97, 2465-2497 (1997); Hopwood, D.A., et aL 
Ann. Rev. Genet, 24, 37-66 (1990); Staunton, J., et al., Chemical Reviews, 97, 261 1-2629 
(1997). Most important, polyketide synthase ("PKS") genes display a high degree of 
sequence homology (from pathway to pathway and organism to organism) and are often 
clustered with genes encoding self resistance and deoxysugar ligand biosynthesis. 
Hopwood, D.A., et al., Chem. Rev., 97, 2465-2497 (1997); Hopwood, D.A., et al., Ann. 
Rev. Genet, 24, 37-66 (1990); Staunton, J., et al., Chem. Rev., 97, 261 1-2629 (1997). 
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Degenerate primers based upon conserved regions within PKS genes were used in 
Southern hybridizations to identify clones from the M. echinospora genomic library that 
carried putative PKS genes. The Southern hybridizations werb performed by methods 
known in the art: Southern hybridization of the gerioniic M echinospora cosmid library 
with a DNA probe designed to target type i PKS ^dne's^KS r ) " (Kakavas, S.J., et al.,7. 
Bacteriol- 1 79, 751 5-7522 (1 997)): Uriv£ile<3 fWH "positive clones; which were designated 
clones 4b; .10a;-l'3aV 56; and 60. Se^'Figure 1 " fc'OTHfsSriie' Ti ve- ; clb ! nes Were also identified 
aipori rescreeriin^ 1. 
Although this prelim iriary.&fel^ presence of Micromonospora 

PKS gene homologues, a secondary screen was 1 performed as PKS hybridization analyses 
are often plagued by false hybridization to gene clusters that encode spore pigment 
biosynthesis. -° " :: ; " ; * " 

The second screening was based on' the assumption that calicheamicin's 
biosynthetic cluster would also contain genes ^coding fot deoxy^ ligand synthesis. 
Further, it was postulated that all hexopyranoSyl iig&ftds of calictfieamicin diverged from 
the common intermediate 4-keto-6-deoxy TDP^D-gluc6se (30), Figure 5, as 
macromolecule-sugar synthesis in many organisms began with a similar common 
intermediate. Thus, it Was believed that the cluster encoding for calicheamicin 
biosynthesis should, in addition to carrying a FKS-encoding region, would carry both a 
common glucose-l-phosphate nucleotidyltransferase and a NDP-a-D-glucose 4,6- 
dehydratase gene, encoding the putative enzymes E pl , and E od , respectively. See figure 5. 
These enzymes are necessary to convert a sugar (12)(figure 5) to the hypothesized 
common intermediate, 4-keto-6-deoxy TDP-D-glucose (30). Analogs to 4,6-dehydratases 
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have been previously characterized from E. coli. Salmonella, and Sireptomyces. 
Additionally, a nucleotide transferase from Salmonella has been characterized as an 
alpha-D-glucose-1 -phosphate thymidylyltransferase/ The secondary screen was 
performed using a probe based upon the postulation that the M. echinospora's 
calicheamiein synthesis would begin from a similar precursor found in E. coli, 
Streptomyces and SalmoneUa^rA that this precursor required a dehydratase to convert it 
into the common intermediate 4-keto-6-deoxy TDP-D- glucose (30). In particular, a 
DNA probe (designated E ocJ ! ) wasvdesigned from the conserved NAD'^-binding site of 
bacterial NDP-a-D-glucose.4:6-.dehydratases. He. X., ci al., Biochem.. 35, 4721-4731 
(1996). Southern hybridization of. the genomic M echinospora cosmid library with the 
Eod probe revealed cross-hybridization with clones 4b, 1 0a, 1 3a, 56. and 60. Two 
additional clones, designated 58 and 66 v were also identified in this screen. See Figure 1. 
This secondary hybridization indicated the clustering of genes encoding both polyketide 
and deoxy sugar biosynthesis, r 

For final corroboration, since.secondary metabolite biosynthesis is typically 
clustered with resistance genes in actinomyeetes, all hybridization-positive clones were 
tested for their ability to grow in the presence of varying concentrations of calicheamiein. 
In this final screen, six of the seven hybridizing clones displayed differing levels of 
resistance to calicheamiein (4b ~ 10a~ 13a>56^66>60)(See Figure 1) while clone 58 
lacked the ability to grow in the presence of calicheamiein. In addition, these resistance 
screens revealed that clones 4b, 10a, 13a conferred much higher levels of resistance to 
calicheamiein than the other clones. Upon rescreening the genomic library for 
calicheamicin-resistant clones, three additional clones (3a, 4a, and 16a) were found to 
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confer similar levels of resistance. Cumulatively, the results demonstrated that clones 4b, 
10a. 13a. 56, and 60 carried PICS ■ I and II homologues and deoxy sugar biosynthetic 
genes, as well as encoded the gene responsible for conferring calicheamicin-self 
resistance. - >. :.. - />. >• : . ;■. ':■*- 

The clones positive for PKS I and II andidcoxy sugar biosynthesis homology and 
calicheamicin resistance were used to map the 'biosynthetic cluster. Southern 
hybridization established similarity between cle«ies?Bay 4a. 4b, 10a, 13a. 16a and 56. In 
addition, nucleotide sequence bverlaps were found between !clqne£:4b: 1 3a. 'and 56. See 
Figure 1 . Restriction, mapping and Southern hybridization of these clones indicated that 
the positive cosmid clones corresponded to a continuous region of the M. echinospora 
chromosome spanning > 1 00 kb. The present invention thus provides for cosmids having 
a nucleic acid molecule from Micromonospora echinospora encoding for a 
nonchromoprotein enediyne; biosynthetic cluster. ' 'M.r 

After isolating the biosynthetic gene cluster and elucidating the sequence, open 
reading frames ; ("orfs") were assigned. Tentative gene assignments were derived from 
amino acid sequence ..similarity of translated orfs to, gene products of known function via 
direct BLAST (Basic Local Alignment Search Tool) database searches on the amino acid 
level, -Karlin^ et al M Proceed Nath Acad Sep, MS,A± 87, 2264-2268 (1990); Karl in, et al., 
Proceed Natl. Acad Set, U.&A.r90. ,5873-58771(1993); A\Xc\m\rNature Genet:, 6, 119- 
129(1994). The ge;ne cluster organization isprovided in figure 1. 

Based on BI^AST analysis tentative gene- assignments were made. It was 
deducted that genes participating in the construction of the aryltetrasaccharide include: a) 
genes encoding nucleotide sugar biosynthesis (calG, H* K, O, Q, and S), b) genes 
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encoding for aryltetrasaccharide assembly (calE and N); and c) genes encoding for 
''tailoring" reactions (calD. F. and J). 

One aspect of the invention relates to transformation of a host cell with M. 
echinospora DNA. This method provides a reproducible transformation efficiency of 
5 -10 3 kanamycin resistant tran^forrnants/^g DNA using a pKCl 139-based vector. The 

invention further provides that the host cell can be but is not limited to bacteria; yeast, 
fungus, insect, plant or marnrnalian. ( Transformations of bacteria, yeast, fungus, insect, 
plant or mammalian cells are performed by methods -known in the art. 

The present invention also provides the isolation and characterization of the gene 
10 encoding for calicheamicin resistance. One aspect of the invention relates to an isolated 

DNA strand having the gene ra/Cand having the DNA sequence SEQ. ID No.: 1 . The 
present invention also relates to an isolated protein CalC, having the amino acid sequence, 
SEQ ID. NO. 2. The invention further provides for calC gene fragments coding for a 
bioactive CalC. The polypeptide, CalC, confers calicheamicin resistance and has 1 81 
15 amino acids. The invention also provides for CalC fragments conferring calicheamicin 

resistance. , . • . 

The coIC locus was isolated by identifying calicheamicin genomic cosmid clones 
that were able to grow on luria bertani {''LB") agar plates containing ampicillin.and 
calicheamicin. The DNA of the positive clones (clones that grew on the plates containing 
2 0 calicheamicin) was isolated and subsequent restriction mapping localized the desired 

phenotype (calicheamicin resistance). The DNA was then sequenced and the open 
reading frames analyzed to ascertain the orf encoding for the desired phenotype. In vitro 
studies were also performed and confirmed the ability of CalC to inhibit DNA cleavage. 
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DNA containing ca/C was cloned into an inducible vector/using known methods, 
resulting in overexpression of calC. The polypeptide product (CalC) was then isolated 
and purified to homogeneity. Analysis of the purified CalC revealed that it is a non-heme 
iron metal loprote in that functions via inhibitibn-'of calicheamicin-induced DNA cleavage 
in vitro. Another aspect of the invention is arTtek^^ calC or a 

fragment of calC encoding for a bioacti ve ntbl^tule: r There* is also provided a transformed 
host cell, preferably bacteria; more preferably ^E^^W^^hta^irigcalC or a fragment of 
calC encoding- for a bioaetive r m6\eculb. : ^ if 1 t; - 

The present invention provides for tfle traWsformatibn of human cells with the 
calC gene. This allows bone marrow cells, for example, to be removed from a patient 
being treated with calicheamicin. and to transform these cells with calC, and return the 
transformed cells to the patient. This allows the patient to tolerate treatment with 
calicheamicin or allows the patient to receive' higher doses of calicheamicin as the 
returned human-cr//C-transfonrie^ The 
transformation is performed by methods known iti the art: The embodiment of the 
invention would be applicable to many diseases being treated with calicheamicin. 

Another aspect of the invention relates t6 att isolated DNA strand containing the 
ca/H gene having the DNA sequence SEQ ID; No: 3* The invention also relates to the 
polypeptide CalH, having amino acid sequence SEQ ID. No. 4. The invention further 
provides for calH gene fragments coding for a bioactive CalH. CalH is involved in the 
formation of the aryltetrasaccharide 4,6-dideoxy-4-hydrcxyJamino-D-glucose moiety. 
CalH catalyzes the conversion of intermediate (30) to intermediate (39) (figure 5). CalH is 
a TDP-6-deoxy-D-glycerol-L-threo-4-hexulose 4-transarninase, which catalyzes a 
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: pyridoxal phosphate ("PLP")-dependent transamination from glutamate to provide 4- 
amino-6rdeoxy TDP-D glucose\(interrnediate 39)(figure 5). The invention also provides 
for CalH fragments that retain bioactivity. There is also provided an expression vector 
containing the calH gene or fragments of the calW gene that encode for a bioactive 
5 polypeptide. CalH were overexposed as a (histidine), 0 -fusion protein and subsequently 

purified by nickel affinity chromatography. 

According; to BLAST ana r ysis.,cr//H closely. resembled, perosamine synthase, an 
enzyme which converts. compound ,30 to compound 39 (See figure 5) en route to the 
biosynthesis of TDP-perosaniine (TDP-4.6-dideoxyT4-amino-P-mannose) in E. coll 
10 Wang, L.. et al;. Infect. Immunol.. 66,, 3545-3551 (1998). Thus CalH was believed to be a 

4-ketohexose aminotransferase. To confirm the tentative BLAST assigned function, a 
combinatorial biosynthesis was performed. Specifically the calH gene from calicheamicin 
was incorporated into a mutant : . strain of Streptomyces Venezuela. The 4-dehydrase gene 
(des\ ) in the methymycin/pikromycin pathway: was deleted in this mutant strain. A 
15 promoter sequence from the S. Venezuela methymycin/pikromycin cluster was 

incorporated in the expression vector to drive the expression of foreign genes (the calH of 
calicheamicin) in S. Venezuela. In -wild type S, Venezuela methymycin/pikromycin 
pathway is known to produce methymycin. neomethymycin, pikromycin, and narbomycin. 
Set figure 6. Deletion of the desl gene in the mutant strain led to the accumulation of the 
2 0 CalH substrate, TDP-4-keto-6-depxyglucose (compound 30, figure 6). The constructed 

expression vector with the S. Venezuela promoter expressed the calH gene to make the 
CalH protein. CalH acted on the substrate, 30, to produce compound 39 (figure 6). 
Compound 39 in turn, with the action of S. Venezuela's DesVII (a glycosyltransferase) 
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produced two methymyciri/pikromycin-caiicheamicin hybrid compounds. See Figure 6. 
compounds 40 and 4 1 . These hybrid compounds carry the 4-amiriohexose ligand of 
calicheamicin. This work provides indisputable support for the ca/H gene assignment as 
encoding the TDP-6-deoxy -D-glycero-L-threo-4-hexulose 4 1 aminotransferase of the 
calicheamicin pathway. The CalH acted on ttte:TDP-4-keto-deoxy glucose substrate 
(compound 30) to produce compound 39- (Figure S). ; ■ 

In addition^ these results reinforce the indiscriminate -nature of the corresponding 
glycosyltransferase (DesVII) as it reveals that the glycosyltraftsferase (DesVII) of the S. 
Venezuela pathway can recognize alternative sugar substrates whose structures are 
considerably different from the original amino sugar substrate, TDP-D-desoamine. The 
results also clearly demonstrate the ability to engineer secondary metabolite glycosylation 
through a rational selection of gene combinations - The successful expression of the CalH 
protein in S. Venezuela by the newly constructed expression vector highlights the potential 
of using this system to express other foreign g&nes in this* strain; 

Thus, one aspect of the present invention further relates to the construction of a 
composite gene cluster having the ability : to ( make arid attach non-natural sugars. The 
invention further provides ah expression vector having a calicheamicin gene operably 
linked to regulatory sequences to control expression of the calicheamicin protein and 
preferably the regulatory sequence is z Streptontyces promoter. The present invention also 
relates to two newly synthesized sugarsv compound (11) and compound (12)(figure 7). 
Compound 1 1 has the formula: : - ' £ .m* • 
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The spectral data of compound 1 1 was as follows: 

'H NMR (500 MHz CDC1 3 . J jn hertz) 5 6.75 (III: dd. J = 16.0. 5.5, 9-H) 6.44 (1H. 
dd,J = 16.0. 1.2,8-H),5.3,4 (lH. ; d,j = 8.0,N-H), 4.96 (lH.m. 1 1-H). 4.27 (IH, d. J=7.5, 
5 1 -H), 3.66 ( 1 H, dd, J 9.5, : 8.p.;4; ? H), 3.60 .( 1 H, d, J = 1 0.5. 3-H). 3 .50 ( 1 H, 1 , J - 9.5. 

3'H), 3. d ( 1 H, m, 5'-H), 3 .4 (1 H,,m. 2--H), 2.84 (IH, dq, J = 10.5, 7.5. 2-H), 2.64 (IH, m. 
10-H), 2.53 (IH, m, 6-H), 2.06,(3^.5. Me-C=0), J .7 (lH,.m, 12-H). ! .66 (I H, m, 5-H). 
1.56 (IH, m. 12-H), 1.4 (IH, M. 5-IJ). 1 .36, (3H, d.l J-7.-5.:2-Me). 1.25 (31 1. D, J = 6.5. 5'- 
Me), 1.24 (IH, m. 4-H), 1.21 (3H. d. J=7 : 5. 6 Me), i.10 (3H, d: J=6.5. 10-Me). 0.99 (3H. 

10 d, J=6,0, 4-Me), 0.91 (3H, t, J =7.2, 12-Me); ,3 C NMR (125 MHz. CDC! 3 ) 6 205.3 (C-7), 

175.1 (C-l), 171.9 (Me-C-O). 147.1 (C-9), 126.1 (C-8), 103.0 (C-l"). 85.8 (C-3), 75.8 (C- 
5'), 75.8 (C-3 1 ). 74. 1 (C-l 1) 70.8 (C-2 1 ), 57.6 (C-4'), 45.3 (C-6). 44.0 (C-2), 38.1 (C-10), 
34-2 (C-5), 33.6 (C-4), 25.4 (C-l 2), 23.7 (Me-C-O), 18.1 (C-6'), 17.9 (6 Me), 17.6 (4-Me), 
16.4 (2-Me), 10.5 (12-Me), 9.8, ( 10 Me). High-resolution FAB-MS calculated for C, 5 H 42 - 

15 NO g (M + H*) 484.2910, found 4F4.2303. . 

Compound 1 2 has the formula: 



o 
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The spectral data of compound 12 w as as follows; 

'H NMR(500 MHz, CDC1 J in hertz) 6 6.69 ( 1 H. dd. J = 16.0. 6.0. 11-H), 6.09 
(lH;dd, J = 16.0. 1.5. 10-H), 5.35 (IB. di J = 8.51 N-H). 4.96 (1H. m, 13-H). 4.36 (1H. d, 
J = 7.5. l'H). 4.19 (1H, m. 5-H). 3.83 (lH-q, J=6;5, 2-H); 3.68 (1H, dt, J-10.0, 8.5, 4'H), 
3.52 (1H, t, J ■= 8.5, 3-*H), 3,50 ( 1 H. m. 54i): 3 42 ; (1 Hi C J - 7.5, 2'-H)V2.92 (1H, dq, J = 
7:0, 5.0, 4-HV2.81 (1H, rn. 8-H). 2.73 : (1H. ty J=7.X-2 f ^H). 2.0^ (3H. a, Me^C-O), 1.8 (IH, 
m, 6-H), i\6 (1H; nil, 14-H), 1 .55 ( i H, rri; '7-H)l^ f 37 (3 H, d. J •= 6.5. 2-Me). 1.32 (3H, d, 
J = 7.0, 4-Me), 1.3 (IRrri; H-l4)^K27 (3hJdH -6.5/5'-Me). 1.25 (1H, m. 7-H). 1.12 
(3R d r J =6.0, 8-Me)Ji.Il (3H, d, J = 6& T2-Me% 1 .07 (3R d. J = 6.0. 6-Me), 0.91(3H. 
1, J -7.2, 1 + Me); high resolution FAB MS calculated for C n H 46 NO, (M+H~) 
540.3 172.found 540.3203. 

One aspect of the invention relates to an isolated DNA strand containing the calG 
gene and having the DNA sequence SEQ ID; NO.: 5. Another aspect of the invention is 
the protein, CalG, having amino acid sequence SEQ ID. No:: 6. Based on BLAST analysis 
it was presumed that calG encoded a 4.6-dehydratase; Dehydratases had been 
characterized from E. coli, Salmonella and Streptomyces, (Thompson, M. et al., J. Gen. 
Microbiol, 138, 779-786 (1992); Vara, J.A., et al.;./ Biol. Chenu 263, 14992-14995 
(1988)), and analogous NDP-D-glucose 4,6-dehydratases had been characterized from a 
variety of organisms. Liu, H.-w. ? et a\.,Ann. Rev. Microbiol., 48, 223-256 (1994): Hallis, 
T.M., et al., Acc. Chem. Res., in press (1999). Based upon these prior studies, it was 
known that the overall transformation catalyzed by 4,6-dehydratases is an intramolecular 
oxidation-reduction where an enzyme-bound NAD^ receives the 4-H as a hydride in the 
oxidative half-reaction and passes the reducing equivalents to C-6 of the dehydration 
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product in the reductive half-reaction. Thus, it appears that Cal G is. necessary for the 
formation of the aryltetrasaccharide 4.6-dideoxy-4-hydroxylaniino-D-glucose moiety. 
CalG appears to be a TDP-D-giucose 4,6-dehydratase which catalyzes the conversion of 
intermediate 13 into intermediate 30. (See figure 5). Another aspect of the invention is an 
5 expression vector containing. calG or a fragment of ealG encoding for a bioactive 

molecule. There is also provided a transformed host cell, preferably bacteria, more 
preferably, E. ^//.containing eit!G or a fragment of calG encoding for a bioactive 
molecule. : : / * ,-■ c ■ <. - \ 

There is also disclosed an isolated DN A strand containing the calS gene. Based on 

1 o sequence homology with other P450-oxidases, CalS appears to be a P450oxidase 

homolog which performs the oxidation of intermediate 39 to intermediate 42 (figure 5). 
The oxidation may occur at the nucleotide sugar level or hydroxylamine formation after 
the sugar has been transferred to the aglycone. There is also provided an expression vector 
containing the calS gene or a fragment otcalS encoding for a bioactive molecule. There is 
15 also provided a transformed host celL preferably bacteria, more preferably, E. coli, 

containing calG or a fragment of calG encoding for a bioactive molecule. 

The present invention allows genetic manipulation of the biosynthetic gene cluster 
to produce calicheamicin analogs. The present invention provides for producing 
calicheamicin analogs by constructing deletions or substitutions of the genes involved in 

2 0 biosynthesis of the aryltetrasaccharide. The invention further provides for in vitro 

glycosylation by altering the glycosylation pattern of calicheamicin (via a 
glycosyltransferase) to produce additional analogs. The invention also provides for 
alteration of the calicheamicin aglycone by genetic manipulation of the genes encoding the 
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biosynthesis of the warhead. Genetic manipulation, such as producing deletions or 
substitutions are performed using methods known in: the art. 

The invention provides for a method of purifying ealicheamicin through affinity 
chromatography. Ca/G, because of its homology with ealicheamicin functions as a 
5 calicheamiciri-sequestering/binding protein. ; Affinity, chromatography is performed using 

methods known in the art. > : : ^ i ; . ■ ; ; , 

The invention relates.to. the expressioa of the genes ; located in thebidsynthetic gene 
cluster by using methods known in the art to insert the genes into a suitable expression 
vector and operably linking the gene to regulatory sequences to control expression of the 
10 gene to produce the protein encoded by the inserted gene. The present invention also 

provides for expression of biologically active proteins by inserting fragments of genes 
selected from the biosynthetic gene cluster, which encode for biologically active proteins, 
into a suitable expression vector, using methods known in the art. The genes would be 
operably linked to regulatory sequences to control their expression. 
15 ■■ .. - • . , .a , ;>• . • ; -, ■ , ' ' : . : 

EXAMPLES 

Example 1 ■ : - 

To rapidly elucidate the nucleotide sequence; thermocycle sequencing was 
accomplished from pUC- ; or pBluescript-based subclones {using Ml 3 primers and primer 
20 walking) as well as directly from isolated cosmids (via primer walking). Nucleotide 

sequence data was acquired using two Applied Biosystems automated 310 genetic 
analyzers and sequences were subsequently assembled: using the Applied Biosystems 
AutoAssembler™ DNA sequence assembly software. ..Dear, S^et ah, Nucl Acids Res., 14, 
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3907-3911 (1991); Huang, X.. Genomics. 14. 18-25 (1992). Orf assignments were 
accomplished using a combination of the computational programs MacVector™ 6.0 and 
Brujene. MacVector is a commercially available softvyare package which provides the 
ability to construct a Micromonospora codon bias table (from known Micromonospora 
sequences) and subsequently use this codon bias table to search for optimal orfs. Fickett, 
J.W., Nucleic Acids Research, 10, 5303-5318 (1982). Alternatively, the shareware 
program Brujene was specifically designed for streptomycetes and assigns priority to orfs 
that illustrate a consistency high G/C% in the wobble position. 

Example 2: Isolating and Characterizing calC 

To isolate the gene(s) responsible for calicheamicin resistance in Micromonospora* 
clones conferring calicheamicin resistance were selected by growth of a Micromonospora 
genomic Afunctional cosmid library on LB plates containing ampicillin (50 |ig ml" 1 ) and 
calicheamicin (0.25 jug ml' 1 ). In this selection, six clones (3a, 4a, 4b, 10a. 13a and 16a) 
displayed resistance to caiicheamicin. Restriction mapping of these clones localized the 
desired phenotype to a ~2kb Pstl-Sacl fragment of DNA. (Figure 2). Maximum tolerated 
concentrations of calicheamicin on the LB plates was ascertained. The results are as 
follows: 



Cosmid or Plasmid 


Maximum tolerated concentration of 
calicheamicin 


cosmids 3a, 4a, 10a, 13a, and 16a 


0.5 |ig ml' 1 


pJT1214and pJT1232 


5.0 \xg ml 1 


pRE7 


20.0 jag ml"' 
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induced pRE7 


50.0 ugmr 1 


pJT 1 224 9 pAP6. Pre ] , and control 
plasmids pUC18, pBiuescript. and pMAL- 

C2 ' ■ 


<0.01 \igm\ l 



Nucleotide sequence analysis. of thz Pst\-Sac\ fragment suggested that it contained 
four possible orfs. The proximal 1 kb of this fragment carried a single orf (calD). The 
distal 1 kb presented three overlapping orf candidates (calC. ca!C\ and ca!C"). Computer 
translation of these three orfs (calC. calC\ and calC") was performed and subsequent 
BLAST analysis of their corresponding proteins. CalC. CalC. and CalC". respectively, 
revealed no homology with known proteins, while the translation of gene calC displayed a 
weak alignment with apoproteins of the chromoprotein enediynes. Translation of calD to 
its respective protein, CalD, revealed the presence of three amino acid motifs typically 
conserved in S-adenosylmethionein-utilizing 0-methyltransferases. Therefore it was 
hypothesized that colD was not responsible for calicheamicin resistance. To rule out calD 
as being responsible for calicheamicin resistance, a subclone was engineered (pjT1224) to 
contain an intact calD, but truncated ca/C, calC\ and calC" genes. This subclone was hot 
able to confer resistance to calicheamicin. Next, a subclone containing the calC region 
was constructed (pjT1232). This clone conferred calicheamicin resistance. See above 
chart. Subclones containing calC (pAP6) and calC" (pREl) were constructed and tested 
for calicheamicin resistance. These clones could not confer resistance to calicheamicin. 
See above chart. 
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To ascertain the amino acid sequence of CalC and learn its properties. calC was 
cloned into a pMAL-C2 vector. (pMAL-C2 by itself could not confer calicheamicin 
resistance. See above chart.) The resulting plasmid. pRE7, which contained calc, 
conferred resistance to calicheamicin. See above chart.. Plasmid pRE7 was then induced 
with {sopropyLBeta-D-thiogalactoside CTPTG") to overexpress CalC. Induced pRE7 
conferred resistance to calicheamicin and produced a maltose-binding protein CalC fusion 
protein (mbp-CaiC). . Thi;^resultirig overexpression of CalC increased calicheamicin 
resistance 1 0>fo Id in vrvu. See above chart. v !■ '■:< • ' - 

Example 3: Expression of protein CalC 

. The protein mbp-CalC was overexpressed and purified for further analysis. The 
mbp-rCalC was purified from pRE7/£. coli xo homogeneity as judged by SDS-PAGE. An 
overnight LB culture (containing 50 mg ml" 1 ampicillin and 50 ng ml"' calicheamicin from 
a fresh pRE7/E. coli colony was grown at 37 °C, 250 rpm to an A 600 =0.5, induced with 0.5 
mM IPTG and growth continued overnight. Cells were harvested (4,000 x g, 4 °C ? 20 
minutes), resuspended in buffer A (50mM Tris-Cl/pH .7.5; 200 mM NaCl, ImM EDTA) 
and disrupted by sonication. The cell debris was removed by centrifugation (5,G00xg, 
4°C, 20 minutes). The supernatant was applied to an amy lose affinity column ( 1.5 x 7.0 
cm, 1 rnL min" 1 ). The desired mbp-CalC protein was eluted with buffer A containing 10 
mM maltose. The eluate was concentrated and chromatographed on an S-300 column 
(50mM TrisrCl, pH 7.5, 200 mM NaCl). Active fractions were used immediately or 
frozen at -80° C for storage. 



28 



WO 00/37608 



PCT/US99/29110 



Example 4: Analysis of Protein CalC 

The purified mbp-CalC was then analyzed- for metal content. Purified mbp-CalC 
displayed a yellow color in concentrated form and subsequent- metal analysis, using 
inductively coupled plasma atomic mass spectrometry O'ICP-MS"), revealed the presence 
of iron (.Fe.). Determination of the Fe stoichiometi^Lacdomplished in conjunction with 
quantitative amino acid hydrolysis, indicated 2l23 ± ;0;3 mol Fe penmol mbp-GalC (based 
.upon the monomelic molecular:weight of^S.SMdaiton'Calculated^from the known 
nucleotide sequence of the mbp-cr//C gene fusion; which is consistent, with the determined 
subunit molecular weight determined by SDS-PAGE). The precise mbp-CalC 
concentration was determined by quantitative amino acid hydrolysis by the Rockefeller 
University Protein/DNA Technology Center/ Trace metal content of an aliquot of the 
hydrolysate was subsequently determined by TCP-MS on four distinct mbp-CalC 
preparations with buffer alone and/or maltose-binding protein alone analyzed in parallel as 
controls. These results were independently confirmed by methodologies used for 
spectrophotometric iron determination. Eish,;W.lW.; Afef/?, EnzymdL 1988, 158, 357-364. 
The electronic absorption spectrum of mbp-CalC is shown in Figure 4. In addition, to the 
A 2S o protein absorbance (6 2S0 = 99.300 M'l cm; I), a clear absorbance maxima at 41 1 nm 
(6^ n = 6,000 M" 1 cm' 1 ) can be observed. Electron para magnetic resonance ("EPR") was 
performed to ascertain the metal content of CalC,- The X-bandlEPR measurements on the 
oxidized CalC proteins exposed a; standard rhombic EPR signal at g = 43 (E/D = 
0.33)(Figure 4„ inset). The metal content was 90 £ 10 jaM Fe (approximately 72 ± 10% of 
total iron as seen by ICP-MS, Figure 4. The spectroscopic evidence indicates the presence 
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of a mononuclear Fe" 3 center in CalC is consistent with the lack of cysteins in the primary 
sequence of CalC See Palmer. G.. Biochem. Soc. Trans. 1985,13,548-560. 

ExampIeS: Verification of CalC*s calicheamicin resistance 
5 Given that calicheamicin leads to double strand DNA cleavage and CalC provides 

calicheamicin-resistance in vivo, it was expected that the addition of CalC to an in vitro 
calicheamicin-induced DNA cleavage assay would inhibit DNA cleavage. To test this 
theory, preliminary assays were performed with supercoiled pBlusecript plasmid DNA 
O'pBS") as the template, and dithiothreitol ( fci DTT*) as the reductive initiator. In a typical 

10 assay, purified mbp-CalC (15.0 nM) and 30.0 nM calicheamicin were preincubated for 15 

min. in a total volume of 25 |al 40 mM Tris-Cl, pH 7.5, at 37 °C. Then 2.5 jil lOmM DTT 
stock solution was added to the assay solution, and the assay was incubated an additional 1 
hour at 37°C. DNA fragmentation was. assessed by electrophoresis on a 1% agarose gel 
stained with ethidium bromide. Using this assay, it was found that mbp-CalC could 

15 completely inhibit calicheamicin-induced DNA cleavage at concentrations nearing 10 3 - 

fold excess of calicheamicin. Preincubation of mbp-CalC and DTT, protein removal via 
forced dialysis, and the subsequent use of the DTT solution as reductant did not noticeably 
affect the amount of DNA cleavage. 

As indicated in Figure 4(b), no DNA cleavage was observed in the absence of DTT 

20 or calicheamicin (lanes a and b), while efficient cleavage was demonstrated in the presence 

of DTT and calicheamicin (lane c). As expected, the addition of mbp-CalC completely 
inhibited calicheamicin-induced DNA cleavage (lane f) while the addition of mbp alone 
(lane d) as a control, failed to inhibit calicheamicin-induced DNA cleavage. Furthermore, 
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preincubation of mbp-CalC with DTT (not shown), or <7/;<7-mbp-CalC (lacking the Fe 
cofactor)(lane e), also failed to inhibit calicheamicin-induced DNA cleavage. However, 
the addition of Fe " 2 or Fe" 3 to the apo-mbp-CalC assay could reconstitute CalC activity 
(lane g). Reconstitution of t/po-mbp-CalC was accomplished by preincubation with 1 mM 
5 FeS0 4 (Fe -) or'TFeCl 3 (Fe J3 ) prior to the activity assay as previously described. 

Example 6: Production of methymycin/pikromycin-calicheamicin hybrid compounds 

The 1:2 kb c6lH g£n£ was amplified 1 fcy : pofymerase 'chain reaction (PCR) from 
pJSTl 192 kpn7 . which is a subclone containing a f.'O'kb Kpn\ fragment of cosmid 13a. The 

10 amplified gene was cloned into the EcbKVXbcA site of the expression vector pDHS61 7. 

This expression vector contains an apfamycin'resistance marker. The plasmid pDHS617 
was derived from pOJI446 (BiermarL M. et al.i Gene 1992, 1 16, 43-49). A promoter 
sequence from the S. Venezuela methymycin/pikromycin cl^ in the 

plasmid to drive the expression of foreign genes in S. Venezuela. The resulting plasmid, 

15 pLZ-C242 (containing the cafht gene insert and the prbmoter sequence) was introduced by 

conjugal transfer using E.coli S 17-1 into a prdvloiisly constructed 5*. Venezuela mutant, 
desl. (Borisova, S. et al.' Org. Lett. 1999: 1 .133-136): In the Desl mutant, the desl was 
replaced by the neomycin resistance gene, which" confers resistance to kanamycin The 
PLS-C242-contaihing 5*. venezuela-Dzsl do forties were identified on the basis of their 

20 resistance to apramycin antibiotid. One of these positive colonies, DesI/calH-1 was grown 

in 100 ml of seed medium at 29°C for 48 hours aiicl then inoculated and grown in five 
Liters of vegetative medium. Cane, D.E., et at:: J Am: Chem. Soc, 1 993, 1 15, 522-526. 
The culture was centrifuged to remove cellular debris ahd mycella. The supernatant was 
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adjusted to pH 9.5 with concentrated KOH. followed by chloroform extraction. The crude 
products (700 mg) were subjected to flash chromatography on silica gel using a gradient of 
1-20% methanol in chloroform. A major product, 10-deoxymethynolide (ca. 400 mg), and 
a mixture of two minor macro! ide compounds were obtained. The two macrolides were 
further purified by HPLC on a G lg .column using an isocratic mobile phase of 
acetonitrile/H 2 0 (1 ; 1 ). ,They were later identified as compound (11) and compound 
(12)(figure 7) by spectra) anaylses. 
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Sequence listing 1 —calC gene: 

atgactcaggagaagaccgcaccggccgcgaagagcacgaccaccaagagca 

ccgccgcgaagaagccgaagcccccgaactacgacccgttcgtccggcacag 

cgtcactgtcaaggccgaccgcaagaccgccttcaagAcgttcctcgaaggct 

5 TTCCGGAGYGGfGGlCC^ 

GGCGTCGACAAGAAGGGCGG(X 

AGCACACCTTCGGCCTG ATCCGG AAGdf GGACGAGCCGG ACACGCTGGTC ATC 
GGCTGGCGGCTCAACGGCTTCGGCCGGATCGACCCGGACAACTCGAGCGAGTT 
CACCGTGACCTTCGTGGCCGACGGCCAGAAGAAGACCCGGGTGGACGTCGAG 
1 0 CACACCCACTTCGACCGGATGGGCACCAAGCACGCCAAGCGGGTCCGCAACG 
GCATGGACAAGGGCTGGCCGACGATCCTCCAGTCGTTCCAGGACAAGATCGAC 
GAGGAAGGGGCGAAGAAGTGA 
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Sequence Listing 2 -CalC protein: 

(Note that in protein sequences amino acids are designated in one-letter code) 
MTQEKTAPAAKSTTTKSTAAKKPKPPNYDPFVRHSVTVKADRKTAFKTFLEGFPE 
WWPNNFRTTKVGAPLGVDKKGGRWYEIDEQGEEHTFGLIRKVDEPDTLVIGWRL 
5 NGFGRIDPDNSSEFTyTFVADGQKKTRVDVEHTHFDRMGTKHAKRVRNGMDKG 
WPTILQSFQDKIDEEGAKK 
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Sequence Listing 3 calH gene: 

GTGGCAACTAGCGAGAGGGGTGTCATGATCCCGCTGTCCAAGGTCGCCATGTC 
TCCGGACGTCAGCACeCGCGTCTCCGCCGtcetGAGCAGTGGCCGGCTGGAGC 
ACGGGCCGACCGTCGCCGAGTACGAGG'eGGt'CGTGGGCAGTCGTATCGGCAA 
5 CCCCCGGGTGGTCTCGGTeAA^tGeGGC^CG^ 

GCCTCGCCGCGCGGCCGGGGGCCGGCGAGfcGGAGCACGACGGCCCGGGCGA 
GGTGCTCACCACGCCGCTGACCTTCGAGGGCACGAACTGGCCGATCCTCGCCA 
ACGGGCTGCGCATCCGGTGGGTGGACGTCGACCCGGCCACCCTCAACATGGAC 
CTCGACGACCTGGCCGCGAAGATCTCGCCCGCCACCCGGGCCATCGTGGTGGT 
1 0 CCACTGGCTCGGCTACCCGGTGG ACCTCAACCGGCTGCGCGCCGTCGTGGACC 

GGGCCACGGCGGGATACGACCGCCGCCCGCTGGTCGTGGAGGACTGCGCGCA 
GGCGTGGGGCGCCACCTACCGGGGCGCGCCGCTGGGCACGCACGGCAACGTC 
TGCGTGTACAGCACCGGCGCGATCAAGATCCTGACGACCGGCAGCGGCGGCTT 
CGTCGTGCTGCCCGACGACGACCTGTACGACCGGCTCCGGCTGCGCCGCTGGC 
1 5 TCGGCATCGAGCGGGCGTCGGACCGGATCACCGGCGACTACGACGTCGCCGA 
GTGGGGCTACCGGTTCATCCTCAACGAGATCGGCGGGGCGATCGGCCTGTCCA 
ACCTGGAACGCGTCGACGAGCTGCTGCGCCGGCACCGGGAGAACGCCGCGTT 
CTACGACAAGGAACTGGCCGGCATCGACGGCGTCGAGCAGACCGAGCGGGCC 
GACGACCGGGAGCCCGCGTTCTGGATGTACCCGCTGAAGGTCCGCGACCGTCC 
2 0 CGCCTTCATGCGCCGGCTGCTCGACGCCGGCATCGCCACCAGCGTCGTGTCGC 
GCCGCAACGACGCGCACAGCTGCGTCGCGTCGGCCCGCACCACCCTGCCCGGG 
CTGGACCGGGTGGCGGACCGCGTGGTCCACATCCCGGTGGGCTGGTGGCTCAC 
CGAGGACGACCGCTCCCACGTCGTCGAAACGATCAAGTCCGGCTGGTGA 
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Sequence Listing 4 —CalH protein: 

MATSERGVMIPLSKVAMSPDVSTRVSAVLSSGRLEHGPTVAEYEAAVGSRIGNPR 
VVSVNCGTAGLHLALSLAARPGAGESEHDGPGEVLTTPLTFEGTNWPILANGLRIR 
WVDVDPATLNMDLDDLAAKISPATRAIVVVHWLGYPVDLNRLRAVVDRATAGY 
5 DRRJLVVEDCAQAV/GATYRGAPLGTHGNYCVYSTGAIKILTrGSGGFVVLPDDD 
LYDRi.RLRRM / LGIERASDRJTGDYDVAEVVGYRFILNEIGGAIG7.SNLERVDELLRR 
HRENAAFYDKF1AG!DGYEQTE1RADDREP,AFWMYPLKVRDRPAFMRRI,LDAGIA 
TS VVSRRNDAHSCVASARTirLPGLDRyA 
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Sequence Listing 5 —calG gene: 

GTGCCCAGATCCCTGGTCACCGGCGGCTTCGGCTTCGTCGGCAGTCACGTCGT 

CGAACGGCTGGTCCGCCGGGGTGACGAGGTCGTCGTCTACGACCTCGCCGACC 

CGCCGCCCGACCTGGAGCACCCGCCGGGCGCGATCCGGCACGTCCGCGGCGA 




ATCCGCGGGTGCCGTGGCGGGAGGACGACGACCGGGTGCTCGGCAGCACGGC 
10 GACGGACCGGTGGTCGTACTCGACGAGCAAGGCGGCGGCCGAGCACCTGGCC 
TTCGCCTTCCACCGGCAGGAGGGCCTGCCGGTGACGGTGCTGCGGTACTTCAA 
CGTCTACGGCCCACGCCAGCGCCCGGCGTACGTCCTCAGCCGCACCGTCGCCC 
GCCTGCTGCGGGGCGTTCCGCCCGTGGTGTACGACGACGGCCGCCAGACGCGG 
TGCTTCACCTGGATCGACGAGGCGGCCGAGGCGACCCTGCTGGCCGCCGCCCA 
1 5 CCCGCGGGCCGTCGGCG AGTGTTTC AACATCGGC AGC AGCGTGG AG ACC ACC 

GTCGCCGAGGCGGTCCGGCTGGCCGGCACGGTGGCCGGGGTGCCGGTGGCGG 
CCCAGACCGCGGACACCGGAGCCGGGCTCGGCGCCCGCTACCAGGACATTCC 
CCGCCGCGTACCGGACTGCGGCAAGGCCGCCGCGCTGCTGGACTGGCGGGCC 
CGGGTGCCGCTGGTGACCGGCCTGCGCCGGACCGTCGAGTGGGCCCGCCGCA 
20 ACCCGTGGTGGACCGCCCAGGCCGACGACGGACTGGTCGTCAGGTAG 
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Sequence Listing 6 - CalG protein: 

MPRSLVTGGFGFVGSHVVERLVRRGDEVVVYDLADPPPDLEHPPGAIRHVRGDV 
RDADGLAAAATGVDEVYHLAAVVGVDRYLSRPLDVVEINVDGTRNALRAALRA 
GARVVVSSTSEVYGRNPRVPWREDDDRyLGSTATDRWSYSTSKAAAEHLAFAFH 
5 RQEGLPVTVLRYFNVYGPRQRPAYVLSRTVARLLRGVPPVVYDDGRQTRCFTWI 
DEAAEATLLAAAHPP^VGECFNIGSSVETTVAEAVRLAGTVAGVPVAAQTADTG 
AGLGARYQDIPRRVPDCC^A/ALLpWRARyPLVTGLRRTVEWARRNPWWTAQ 
ADDGLVVR .,>,;■ ^ . 
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CLAIMS 

1 . An isolated nucleic acid moleeulfe frdhi MicromOnospora echinospora comprising 
a nucleic acid seqUehce encoding for a geiie from a nonchromoprotein enediyne 
biosynthetic gene cluster, the proteih coding region of said gene or k biologically 
active fragnient 6f said gene! ; v/ ! CiO>rI . ^ 

2. The isolated riuclfeic acid molecule 6t Claim 'f; wherdin said gene is calA. calB, 
ca/Cy^/D, itiWcMicalQ eait, calWctifti^ calO, calP, 
calQ. calR, calS, caTY, or/1, or/2, orfi* or/4, or/5, or/6, or/7?or an IS-element 
gene. 

3. The isolated nucleic acid molecule of Claim 1 . wherein said molecule encodes two 
or more of said genes. 

4. The isolated nucleic acid molecule of Claim 1 , wherein said molecule encodes the 
full biosynthetic gene cluster 

5. The isolated nucleic acid molecule of Claim 1, wherein said nonchromoprotein 
enediyne is calicheamicin. 

6. An isolated nucleic acid molecule capable of hybridizing with a nucleic acid from 
Micromonospora echinospora spp. calichensis encoding for one or more genes 
from a nonchromoprotein enediyne biosynthetic gene cluster. 

7. The isolated nucleic acid molecule of Claim 6, wherein said molecule encodes a 
protein having the activity of at least one gene from the biosynthetic gene cluster. 

8. The isolated nucleic acid molecule of Claim 6, wherein said gene is calA, ca/B, 
calC, calD, calE, calF, ca/G, calH, call, calJ, calK, calL, calM, ca/N, calO, calP, 
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ca/Q, calR, ca/S. call, or/\^. or/2, or/3, or/4, or/5, or/6* or/1 \ or an IS-element 
gene. 

9. The isolated nucleic acid molecule of Claim 1. comprising SEQ ID No.l. 

10. The isolated nucleic acid molecule of Claim 1, comprising SEQ ID No.3. 
5 11. The isolated nucleic acid molecule of Claim 1, comprising SEQ ID No.5. 

12. The isolated nucleic acid molecule of Claim 1, which encodes a polypeptide 
encoding for a P^ 0 oxidase from Micromonospora echinospora spp. calichensis. 

13. The isolated nucleic acid molecule of Claim 1, which encodes a polypeptide 
encoding for a membrane transporter from a gene cluster of Micromonospora 

10 echinospora spp. calichensis coding for calicheamicin biosynthesis. 

14. The isolated nucleic acid molecule of Claim 1 ? which encodes a polypeptide 
encoding for an O-methyltransferase from a gene cluster of Micromonospora 
echinospora spp. calichensis, coding for calicheamicin biosynthesis. 

15. The isolated nucleic acid molecule of Claim 1, which encodes a polypeptide 
15 encoding for a glycosyltransferase from a gene cluster of Micromonospora 

echinospora spp. calichensis coding for calicheamicin biosynthesis. 

16. The isolated nucleic acid molecule of Claim 1, which encodes a polypeptide 
encoding for a 7V,//-dimethyltransferase from a gene cluster of Micromonospora 
echinospora spp. calichensis coding for calicheamicin biosynthesis. 

20 17. The isolated nucleic acid molecule of Claim 1 , which encodes a polypeptide 

encoding for a dipeptide transporter from a gene cluster of Micromonospora 
echinospora spp. calichensis coding for calicheamicin biosynthesis. 
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18. The isolated nucleic acid molecule of Claim 1. which encodes a polypeptide 
encoding for a L-cysteine/cystine C-S-lyase from a gene cluster of 
Micromonospora echinospora spp, calichensis coding for calicheamicin 
biosynthesis. 

5 19. The isolated niicfeic acid molecule of Claim 1*, winch encodes a polypeptide 
encoding for an oligopeptide transporter : protein from a gene cluster of 
Micronionospdra Echinospora spp. cdltchehstsc6d\ng for calicheamicin 
■ biosyrithesfsl ' 1 " 

20. s The isolated nucleic acid molecule of Claim 1. which encodes a polypeptide 
1 0 encoding for a regulator)' protein from a gene cluster of Micromonospora 

echinospdra spp, calichensis coding for calicheamicin biosynthesis. 

21. The isolated nucleic acid molecule of Claim 1, which encodes a polypeptide 
encoding for a hexopyranosy 1-2-3 -reductase from Micromonospora echinospora 
spp, calichensis, 

15 22. The isolated nucleic acid molecule of Clairri i /which "encodes a polypeptide 

encoding for a desaturase from a gene cluster of Micromonospora echinospora spp. 
calichensis coding for calicheamicin biosynthesis. 

23. The isolated nucleic acid molecule of Claim 1 , which encodes a polypeptide 
encoding for an UDP-D-glucose 6-dehydrogehase from Micromonospora 

20 echinospora spp . calichensis! ~ yt ■-•■•■' 4U 

24. The isolated nucleic acid molecule of Clidihi 1 , wTiich encodes a polypeptide 
encoding for a transcriptional regulator from a gene cluster of Micromonospora 
echinospora spp. calichensis coding for calicheamicin biosynthesis. 
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25. An expression vector comprising a nucleic acid molecule encoding a protein 
coding sequence, wherein the nucleic acid molecule is selected from any of Claims 
1 through 24, \ 

26. The expression vector of Claim 25, wherein said nucleic acid molecule is operably 
5 linked to regulatory sequences to control expression of said protein. 

27. The expression vector of Claim 26. wherein the regulatory sequence is a 
Streptomyces ; promoter: ,\= - " 

28. A host cell transformed vsWth a nucleic acid rnoleciiie from any one of Claims 1 
through 24. 

10 29. A host cell transformed with, a nucleic acid molecule from Claim 25. 

30. A host cell transformed with a nucleic acid molecule from Claim 26. 

31 . The host cell of Claim 28, wherein said host cell is a bacterium, yeast, insect, plant, 
fungi, or mammalian cell. 

32. The host cell of Claim 28. wherein the host bacteria is £. coli or Streptomyces. 
15 33. A cosmid comprising an isolated nucleic acid molecule from Micromonospora 

echinospora spp. calichensis, comprising a nucleic acid sequence encoding for a 
nonchromoprotein enediyne biosynthetic gene cluster. 

34. The cosmid of Claim 33, wherein said sequence encodes ca/A, calB* calC, ca/D, 
ca/E, ca/F, calG, ca/H, call, call, calK, ca/L, calM, ca/N, calO, calP, calQ, caIR, 

20 calS, calT, orfi, or/2, or/3, or/4, or/5, or/6, or/7, and an IS-element gene. 

35. A method of expressing a protein comprising culturing a host cell with an 
expression vector from Claim 25 for and incubating under time and conditions that 
allow for protein expression. 
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36. The method of Claim 35. wherein said host cell is a bacterium, yeast insect, plant, 
fungi, or mammalian celh 

37. A method of purifying calicheamicin using affinity chromatography, comprising 
providing a solution containing calicheamicin. to an affinity column having CalC 

5 bound thereto, , and recovering ealicheamiciini!'). *v ■ 

38. A polypeptide comprising an amino acid -'Sequence SEQ ID. No.: 2. 

39. A polypeptide comprising an amino acid sequence- SEQ ID- No.: 4. 

40. . A polypeptide comprising an amino acid sequdnee^SEQ ID. No,: 6. 

41 . A method of conferring calicheamicin resistance to a subject comprising obtaining 
10 cells from the subject, transforming the cells with the calicheamicin self resistance 

gene and returning the cells to the subject. 

42. A compound having the structure: 

o 

Me 

13 

O 



15 




// 

43 . A compound having the structure : 

o 



20 
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Summary of cosmid clones isolated from M. echinospora genomic library. 



clone" 


type I PKS genes* 


type II PKS genes* 


cieo.xy sugar genes* 


resistance 
(ngmL-') f 


3a 


N.D/ 


N.D/ 


N.D/ 


0.5 


4a 


N.D/ 


N.D/ • 


N.D/ 


0.5 


4b 


+ 


' +' 




0.5 


10a 


+ 


+ 


+ 


0.5 


•13a 


+ 


•f 


+ ' 


0.5 


16a 


N.D/ 


N.D/ 


N.D/ 


0.5 


56 


+ 




+ 


0.1 


58 






+ 


<0.01 


60 


+ 




+ 


0.05 


66 






+ 


0.04 
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putative number of 
polypeptide amino acids 



proposed function or sequence similarity 
detected* 



probability 6 



start/stop 
codons 



best match 



CalA 328 membrane transporter (ATP-binding) 

CalB 561 membrane transporter 

CalC 181 caJicheamicin resistance protein 

CalD 263 O-methyitransferase 

CalE 420 gly cosy 1 transferase 

CalF 245 AW-dimethvltransferase 

CalG 990 TDP-D-giucose4.6-dehydratase 

CalH 338 perpsairine synthetase 

Call 568 : dipss^de transporter 

Call 332 0-mfnhytohsferase 

CaiK 440 ; . L-cysn= or/cysaiis C-S-lyase '' 

CaiL 562 oiigopeuude:i»:\isponer protein 

CalM 416 regrjlaiory pzofzem 

CaiN 398 glydc^jO transferase 

CalO 331 hexopvrar.osyl -2.3 reductase 

CalP (179)* ' desaturase 

CalQ 453 LTDPrD-giucose 6-dehydrogenase 

CalR 265 reeutatorv protein 

CalS 1113 ~P 4J0 oxidase 

CalT 280 transcripiional regulator 

orfl 322 unknown 

orfl 654 unknown 

orfh 209 unknown 

orfA 521 unknown 

orfi 175 unknown 

orfS 139 unknown 

orfl 187 unknown 

IS-eiement 1209 bp insertional element 



5.4xI0* ,M 
5.5x1 0' 70 
confirmed' 1 
1,1x10-" 
4.7xl0* w 
1.5xl0' 78 



ATG/TGA 
ATG/TGA 
ATG/TGA 
ATG/TGA 
GTG/TAG 
ATG/TGA 



confirmed" GTG/TAG 
confirmed' GTG/TGA 



l:7xl0' M 
1.0x10*" 
1.6xl0* M 
9.5X10' 14 

3.4xi0' 79 
4.9x10'" 
5.7xl0- ? 
6.7xl0* 74 
3.7xlO' n 
2.9x10 66 
1.7xl0' u 



GTG/TGA 
ATG/TGA 
GTG/TGA 
ATG/TGA 
GTG/TGA 
ATG/TGA 
ATG/TGA 
/TGA 
GTG/TGA 
ATG/TGA 
GTG/TGA 
ATG/TGA 
ATG/TGA 
ATG/TGA 
ATG/TGA 
GTG/TAA 
ATG/TGA 
ATG/TGA 
GTG/TGA 



DrrA 97 
DrrB 97 
section C5-2. 
AveBVII* 
ErvCH" 
DesVI 100 

section CA.3. 

section C.4A. 
DciAE ,0! 
DrnpM lc: 
C-DES 10 - 1 
OppA 101 

Olcl ,os 
EryBII* 

cm 106 

gene 836' 07 
KorSA ,GS 
Biol' 09 
SC5C7.03 ,,L ' 



5.7x10' 



ISJ136' 11 
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